Picture for Jiaqing Liang

Jiaqing Liang

Deep Research as Rubric for Reinforcement Learning

Add code
May 31, 2026
Viaarxiv icon

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Add code
May 28, 2026
Viaarxiv icon

SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment

Add code
Apr 14, 2026
Viaarxiv icon

Structured Reasoning for Large Language Models

Add code
Jan 12, 2026
Viaarxiv icon

LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Add code
Jan 10, 2026
Viaarxiv icon

ComLQ: Benchmarking Complex Logical Queries in Information Retrieval

Add code
Nov 15, 2025
Figure 1 for ComLQ: Benchmarking Complex Logical Queries in Information Retrieval
Figure 2 for ComLQ: Benchmarking Complex Logical Queries in Information Retrieval
Figure 3 for ComLQ: Benchmarking Complex Logical Queries in Information Retrieval
Figure 4 for ComLQ: Benchmarking Complex Logical Queries in Information Retrieval
Viaarxiv icon

Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following

Add code
Oct 16, 2025
Viaarxiv icon

HINT: Helping Ineffective Rollouts Navigate Towards Effectiveness

Add code
Oct 10, 2025
Viaarxiv icon

CultureScope: A Dimensional Lens for Probing Cultural Understanding in LLMs

Add code
Sep 19, 2025
Viaarxiv icon

GORACS: Group-level Optimal Transport-guided Coreset Selection for LLM-based Recommender Systems

Add code
Jun 04, 2025
Viaarxiv icon